Observation points classifier ensemble for high‐dimensional imbalanced classification

نویسندگان

چکیده

In this paper, an Observation Points Classifier Ensemble (OPCE) algorithm is proposed to deal with High-Dimensional Imbalanced Classification (HDIC) problems based on data processed using the Multi-Dimensional Scaling (MDS) feature extraction technique. First, dimensionality of original imbalanced reduced MDS so that distances between any two different samples are preserved as well possible. Second, a novel OPCE applied classify by placing optimised observation points in low-dimensional space. Third, optimization point mappings carried out obtain reliable assessment unknown samples. Exhaustive experiments have been conducted evaluate feasibility, rationality, and effectiveness seven benchmark HDIC sets. Experimental results show (1) can be trained faster than high-dimensional data; (2) correctly identify number increased; (3) statistical analysis reveals yields better performances selected sets comparison eight other algorithms. This demonstrates viable problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ensemble Classifier for Benign-Malignant Mass Classification

Mammography is currently the most effective imaging modality for early detection of breast cancer. In a CAD system for masses based on mammography, a mammogram is segmented to detect the masses. The segmentation gives rise to mass regions of interested (ROIs), which are either benign or malignant. There is a need to classify the extracted mass ROIs into benign and malignant masses; it is a hard...

متن کامل

Classifier Ensemble for Uncertain Data Stream Classification

Currently available algorithms for data stream classification are all designed to handle precise data, while data with uncertainty or imperfection is quite natural and widely seen in real-life applications. Uncertainty can arise in attribute values as well as in class values. In this paper, we focus on the classification of streaming data that has different degrees of uncertainty within class v...

متن کامل

Ensemble Approach for the Classification of Imbalanced Data

Ensembles are often capable of greater prediction accuracy than any of their individual members. As a consequence of the diversity between individual base-learners, an ensemble will not suffer from overfitting. On the other hand, in many cases we are dealing with imbalanced data and a classifier which was built using all data has tendency to ignore minority class. As a solution to the problem, ...

متن کامل

Ensemble Learning for Imbalanced E-commerce Transaction Anomaly Classification

This paper presents the main results of our on-going work, one month before the deadline, on the 2009 UC San Diego data mining contest. The tasks of the contest are to rank the samples in two e-commerce transaction anomaly datasets according to the probability each sample has a positive label. The performance is evaluated by the lift at 20% on the probability of the two datasets. A main difficu...

متن کامل

Weighted Neighborhood Classifier for the Classification of Imbalanced Tumor Dataset

Machine learning is widely applied to gene expression pro ̄les based molecular tumor classi ̄cation, but sample imbalance problem is often overlooked. This paper proposed a subclassweighted neighborhood classi ̄er to address the imbalanced sample set problem and a novel neighborhood rough set model to select informative genes for classi ̄cation performance improvement. Experiments on three publicly...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: CAAI Transactions on Intelligence Technology

سال: 2022

ISSN: ['2468-2322', '2468-6557']

DOI: https://doi.org/10.1049/cit2.12100